Natural-emotion GMM transformation algorithm for emotional speaker recognition
نویسندگان
چکیده
One of the largest challenges in speaker recognition is dealing with speaker-emotion variability problem. Nowadays, compensation techniques are the main solutions to this problem. In these methods, all kinds of speakers’ emotion speech should be elicited thus it is not user-friendly in the application. Therefore the basic problem is how to get the distribution of speakers’ emotion speech and how to train emotion GMM from their natural speech. This paper presents a natural-emotion GMM transformation algorithm to train users’ emotion model to overcome this problem. The algorithm can convert natural GMM to emotion GMM based on an emotion database. It only needs speakers’ natural speech and needn’t to align the natural utterances with the emotion utterances. The performance evaluation is carried on the MASC database. The promising result is achieved compared to the traditional speaker verification.
منابع مشابه
Automatic Speech Emotion and Speaker Recognition based on Hybrid GMM and FFBNN
In this paper we present text dependent speaker recognition with an enhancement of detecting the emotion of the speaker prior using the hybrid FFBN and GMM methods. The emotional state of the speaker influences recognition system. Mel-frequency Cepstral Coefficient (MFCC) feature set is used for experimentation. To recognize the emotional state of a speaker Gaussian Mixture Model (GMM) is used ...
متن کاملSpeaker Recognition System Based on the Baseband Correlation Score Reliability Fusion
Emotion mismatch between training and testing will cause system performance decline sharply which is emotional speaker recognition. It is an important idea to solve this problem according to the emotion normalization of test speech. This method proceeds from analysis of the differences between every kind of emotional speech and neutral speech. Besides, it takes the baseband mismatch of emotiona...
متن کاملComparison between Gmm-svm Sequence Kernel and Gmm: Application to Speech Emotion Recognition
Speech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two standard methods used for speaker recognition and verification: Gaussian Mixture Models (GMM) and...
متن کاملEmotional Speaker Identification by Humans and Machines
This paper concerns the problem of the effect of emotion change on human and machine for speaker identification. A contrasting experiment is carried out between Automatic Speaker Identification (ASI) system (applying GMM-UBM and Emotional Factor Analysis (EFA) algorithm)and aural system on emotional speech corpus MASC. The experimental result is similar to that in channel-mismatched condition, ...
متن کاملEmotion attribute projection for speaker recognition on emotional speech
Emotion is one of the important factors that cause the system performance degradation. By analyzing the similarity between channel effect and emotion effect on speaker recognition, an emotion compensation method called emotion attribute projection (EAP) is proposed to alleviate the intraspeaker emotion variability. The use of this method has achieved an equal error rate (EER) reduction of 11.7%...
متن کامل